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Abstract 

A statistical analysis of the prime numbers indicates possible traces of quantum chaos. We have computed the nearest 
neighbor spacing distribution, number variance, skewness, and excess for sequences of the first A'' primes for various 
values of N. AH four statistical measures clearly show a transition from random matrix statistics at small N toward 
Poisson statistics at large N. In addition, the number variance saturates at large lengths as is common for eigenvalue 
sequences. This data can be given a physical interpretation if the primes are thought of as eigenvalues of a quantum 
system whose classical dynamics is chaotic at low energy but regular at high energy. We discuss some difficulties 
with this interpretation in an attempt to clarify what kind of physical system might have the primes as its quantum 
eigenvalues. 
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1. Introduction 

One of the primary results that has emerged from the study of the quantum dynamics of classically chaotic 
systems is the connection between eigenvalue statistics and classical dynamics |ll2l3j . Quantum systems 
with classically regular dynamics typically have eigenvalues that follow Poisson statistics [T. Quantum 
systems with classically chaotic dynamics have eigenvalues that follow the statistics of random matrix 
eigenvalues |5l6j . Different ensembles of random matrices correspond to different symmetries of the system. 
The eigenvalues of time-reversible systems without spin- 1/2 interactions follow the statistics of the Gaussian 
Orthogonal Ensemble (GOE), the eigenvalues of time-irreversible systems follow the statistics of the Gaussian 
Unitary Ensemble (GUE), and the eigenvalues of systems with spin-1/2 interactions follow the Gaussian 
Symplectic Ensemble (GSE). Generic quantum systems, which have mixed chaotic and regular classical 
dynamics, follow statistics that are intermediate between Poisson and random matrix statistics [7J. 

This surprising connection between quantum eigenvalues and random matrices has led to an even more 
surprising connection between quantum mechanics and number theory. The imaginary parts of the non- 
trivial zeros of the Riemann zeta function, a function of paramount importance in the theory of prime 
numbers, display GUE statistics to very high precision [8]. This finding led to speculation that the zeta 
zeros can be thought of as eigenvalues of a time-irreversible quantum system that is classically chaotic. 

We take a more direct approach to connecting prime numbers and quantum physics by applying to the 
primes themselves the same statistical analyses that are used to study eigenvalue sequences. Some attempts 
in this direction have already been published. Porter examined the nearest neighbor spacing distribution 
(NNSD), the distribution of differences between consecutive primes, for primes in the vicinity of 1.2 x 10* [9j. 
The resulting NNSD generally follows the expectation for Poisson statistics, but with two major deviations: 
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large peaks at spacings equal to multiples of 6, and a shortage of spacings near zero. The first deviation 
can be explained by the fact that all primes greater than 3 are equal to either l(mod 6) or — l(mod 6). 
The second deviation can be partially attributed to the fact that, with one exception, all primes are odd 
integers and so the spacing between consecutive primes cannot be less than 2. LibofF and Wong also studied 
the NNSD of primes, but they examined primes in the range 2 to 10^ [TO]. They came to quite a different 
conclusion from that of Porter, claiming that the primes seem to fit the GOE distribution rather than the 
Poisson. There are some problems with their analysis in that they did not unfold the primes (see Section |2| 
before calculating the NNSD and the fit of their histograms to the GOE distribution is questionable. Their 
results are, however, sufficient to call into question the conclusion that the primes (at least the small primes) 
follow Poisson statistics. 

In this paper we present a statistical study of both relatively large (« 3 x 10^^) and small primes. Several 
statistical studies of prime numbers have been published recently |llll2ll3ll4ll5j but these studies did 
not use the tools that are typically applied to quantum eigenvalue spectra. Using these tools can provide 
information on what kind of physical system might have the primes as its quantum eigenvalues because of 
the connection between eigenvalue statistics and classical dynamics. Our goals are to shed some light on this 
intriguing topic, clarify the confficting results of Porter and Liboff and Wong, and add to the growing body 
of statistical data on the primes. 



2. Level spacings 



Once a particular subsequence of primes has been generated it must be unfolded before the statistical 
calculations are performed. If the primes are thought of as energy eigenvalues then unfolding accounts for 
the way the density of states changes with energy. The standard procedure for unfolding a sequence of 
eigenvalues Xi is to insert each eigenvalue into the average level staircase function, fi{x). The average level 
staircase function is a smooth function that gives the approximate number of eigenvalues less than E. For the 
primes the exact level staircase function is usually denoted by n{x) and the average level staircase function 
is given by a smooth function that approximates ■n{x). The celebrated Prime Number Theorem indicates 
that 7r(x) is well-approximated by a:/log(a;) in the limit x — > oo. However, a better approximation for ir^x) 
is given by the log integral function: 

2 

For small primes an even better approximation for ■k{x) is given by 

Llog(a;)/log(2)J 
m— 



Wl 



th the Moebius function ^{m) defined by 
1, if TO = 1 

l^{m) = <( 0, if TO is divisible by a square of a prime (3) 
(—1)'', otherwise 

where k is the number of prime divisors of to. To unfold the primes we transform from the sequence of 
primes p„ to the sequence e„ = i?(p„). The sequence e„ will then have a mean density of one throughout the 
sequence. Making the mean density uniform in this way allows us to more easily study fluctuations about 
the mean. 

One of the most common ways to study these fluctuations is to examine the NNSD mentioned above. 
Figure [l] shows the NNSD for four different subsequences of primes, along with the curves for the Poisson 
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distribution and the Wigner GOE distribution (which closely approximates the exact GOE distribution) 
given by 

Ppoisson(s) = e-' and Pgoe(s) = y e^^^'/^ (4) 

Note that the most likely spacing for Poisson statistics is s = 0, while the most likely spacing for GOE 
statistics is s = \/^Jt^- For this reason eigenvalues that follow Poisson statistics are said to exhibit level 
clustering while those that follow GOE (or any of the random matrix) statistics are said to exhibit level 
repulsion. Figure [TJl shows the NNSD for the first 100 primes, which shows definite level repulsion and closely 
resembles GOE statistics. Increasing the number of primes in the subsequence, as in Figs. [TJa and[lj:, reduces 
the level repulsion and reveals a transition toward Poisson statistics. Fig. [iji shows the NNSD for the 10^ 
primes immediately following the lO^^th prime. The histogram in Fig.[l|i fits the Poisson distribution except 
for deviations similar to those observed by Porter. Overall these results suggest that small primes exhibit 
level repulsion and obey GOE statistics, while larger primes show a progressive trend toward level clustering 
and Poisson statistics. 

Berry and Robnik developed a statistical model that interpolates between GOE and Poisson statistics [7] . 
They considered an eigenvalue sequence that is a combination of one or more independent Poisson sequences 
and a single independent GOE sequence. The spacing distribution for such a sequence is given by 

P(s, pi) = ple-f"'eifc{y^j5s/2) + {2pip + np^s/^) exp(-pis - np'^s^/^) (5) 

where erfc(a;) is the complementary error function, pi is the fraction of the eigenvalues that come from any 
of the Poisson sequences, and p = 1 — pi is the fraction of the eigenvalues that come from the GOE sequence. 
The solid curves in Fig. [l] show the Berry-Robnik distribution with the parameter pi determined by a fit to 
the number variance data (see Section [s]) for each sequence. The curves fit the histograms reasonably well 
except for deviations similar to the deviations from Poisson statistics seen in Porter's work mentioned above. 
Note that the Berry-Robnik curve in Fig. [TJi is indistinguishable from the GOE curve and the Berry-Robnik 
curve in Fig. is very close to the Poisson curve. 

3. Other Statistics 



The NNSD provides information about correlations between neighboring values in a sequence. To more 
closely examine the statistical properties of primes we can turn to other statistical measures that provide in- 
formation about longer-range correlations, or correlations between more than just two values in the sequence. 
Three commonly used statistical measures are the number variance, skewness, and excess (or kurtosis). The 
number variance (S^) measures two-point correlations over a longer range than the NNSD while the skew- 
ness (71) and excess (72) measure 3- and 4-point correlations, respectively. Each of these statistics can be 
defined in terms of the moments 

H = {{n-{n)y) (6) 

where n counts the number of values in an interval of length L and (. . .) represents an average taken over 
many such intervals throughout the entire sequence. In terms of these moments 

= A*2, 71 = M3Ai2^^^^, 72 = ^4^2^^ - 3. (7) 

The expected results for these statistics in the Poisson, GOE, and GUE cases can be found in Chapter 16 
of Ref. [16^. 

Figure [2] shows the number variance for the same subsequences of primes examined in Fig. [T] as well as 
the curves for Poisson, GOE, and GUE statistics. The data for the first 100 primes in Fig. matches the 
GOE curve (with small deviations at larger values of L that are likely due to poor statistics since there are 
so few numbers in this sequence). The data for the first lO'' (Fig. |2|3) and 10^ (Fig. primes show a shift 
away from the GOE curve and toward the Poisson curve as more primes are included in the subsequence. 
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The data for the first 10^ primes after the lO^^th prime (Fig. [2ji) continues this trend, but still displays 
significant deviations from the Poisson curve. 

The solid curves in Fig. [2] are derived by fitting the data to the Berry-Robnik distribution mentioned in 
Section [2j The formula for the number variance in Berry-Robnik statistics is 

S|r(L, Pi) = E|,i,,„„(piL) + ^loEipL) (8) 

where again it is assumed that a fraction pi of the eigenvalues in the sequence follow Poisson statistics while 
the remaining fraction {p — I — pi) follow GOE statistics. It is clear from Fig. |2]that the Berry-Robnik 
distribution fits the data quite well. Table[l]shows the values of pi determined by fitting the number variance 
data in the range < L < 5 to the Berry-Robnik distribution for several different subsequences of primes. 
These data clearly show a transition from GOE statistics for small primes toward Poisson statistics for larger 
primes. However, it should be noted that pi is still well below one even for primes near the lO^^th prime. 

Higher-order correlations follow a pattern similar to that seen in the number variance. Figure [3] shows the 
skewness as a function of L for the same sequences examined in Figures [T| and [2] Figure [4] shows the excess 
for these sequences. The curves in both of these figures show the expected results for Poisson, GOE, and 
GUE statistics. In both figures there is a transition from random matrix statistics for small primes toward 
Poisson statistics for larger primes. However, the skewness for the first 100 primes (Fig.jsk) seems to fall 
somewhere between the GOE and GUE curves and the excess for the first 100 primes (Fig. Ha) appears to fit 
the GUE curve better than the GOE curve. It is difficult to determine the significance of the skewness and 
excess results for the first 100 primes because this data shows large fiuctuations due to the small number 
of values in the sequence. The skewness and excess curves for the Berry-Robnik distribution using the pi 
values from Table [l] do not fit the data in Figs. [3] and |4] and we were unable to obtain an acceptable fit to 
these data using any value for pi . This could be an indication that the primes are not really a combination of 
independent Poisson and GOE sequences, or it could indicate that more than one GOE sequence is involved. 
We attempted to fit the skewness and excess data to a Berry-Robnik distribution for a mixture of Poisson 
and GUE sequences and for a mixture of Poisson and two GOE sequences without any success. 

To further clarify whether or not the first 100 primes really follow GOE statistics we also examined 
the statistics of the first 50 alternate primes (2, 5, 11, . . . , 523). If every other number in a GOE sequence 
is removed, the resulting sequence is known to follow GSE statistics. We generated a list of the first 50 
alternate primes, unfolded this sequence using R(a;), and rescaled the resulting sequence so that the mean 
spacing was one. We then examined the level spacing distribution, number variance, skewness, and excess 
as described above. The results are presented in Figure |5] The solid curves in Figure |5] show the expected 
results for the GSE distribution. The Wigner GSE level spacing distribution is given by 

^ose(.) = |^e--^/(-) (9) 

and the expected results for S^, 71, and 72 for GSE statistics can be found in Chapter 16 of Ref. [TB]. The 
fit between the data and the GSE results is fairly good in all cases, although there are noticeable differences 
that may simply be ffuctuations that result from the small number of values in the sequence. Overall, the 
statistics of the first 50 alternate primes adds support to the claim that the first 100 primes follow GOE 
statistics. 

4. Interpretation of the results 

These statistical results can be given a physical interpretation because of the connection between classical 
dynamics and eigenvalue statistics discussed in Section [T] Numerical studies of several model systems have 
shown similar transitions between random matrix and Poisson statistics |17ll8j . In these cases the transition 
in statistics occurs when a parameter is changed in such a way as to change the classical dynamics of the 
system from chaotic to regular. In the case of the primes the transition occurs as one moves toward larger 
and larger primes. If the primes are thought of as energy eigenvalues of a quantum system, then the classical 
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dynamics of that system would seem to be chaotic at low energies and increasingly regular at higher energies. 
There are, however, some problems with this interpretation which we will discuss below. 

One potential problem is that the deviation from Poisson statistics might be due to improper unfolding. 
The use of some local unfolding procedures on Poisson sequences has been shown to produce apparent 
deviations from Poisson statistics [19\ However, the unfolding procedure we have used is a global unfolding 
based on the function R(x) which is known to accurately approximate the mean level staircase function for 
the primes. We have also tried unfolding the sequence of primes using a;/log(a:) and Li(a;) and there is no 
substantial change in the statistical results, so we are confident that our unfolding procedure is correct. 

Another alternative to our suggested interpretation is that the primes may be eigenvalues of an integrable 
system and the deviations from Poisson statistics at low energies may be due to the influence of short peri- 
odic orbits in the corresponding classical system. Berry has shown, using semiclassical arguments, that the 
spectral rigidity (a statistic closely related to the number variance) will saturate at a value of L determined 
by the period of the shortest periodic orbit of the classical system [20 . This effect can lead to large devia- 
tions from Poisson statistics in integrable systems, even mimicking GOE statistics, at low energies while at 
higher energies the statistics approach the Poisson curves [21,22J. The usual pattern in these cases is that 
the spectral rigidity (or the number variance) follows the Poisson curve up to i « imaxi then saturates and 
remains at a roughly fixed value as L is increased beyond imax- Since Lmax increases with energy the overall 
result is a transition toward Poisson statistics at higher energies. In the primes, however, we see deviations 
at small L even when the energies are quite large (as shown in Fig. [2ji). In fact, the number variance for 
the primes does saturate but at values of L much larger than those shown in Fig. [2j Figure |6] shows that 
this saturation of the number variance occurs for sequences of primes and that the value of L at which the 
number variance saturates increases as the size of the primes in the sequence increases. 

There are other issues that complicate the association between eigenvalue statistics and classical dynamics. 
Large deviations from Poisson statistics have been found in pseudointegrable systems ^23,24J . In addition, 
fully chaotic systems can display deviations from random matrix statistics due to the presence of localized 
quantum states associated with classical structures like cantori |25j or a continuous line of periodic orbits 

m- 

One fact that seems to contradict the suggested interpretation of the prime statistics is that if the primes 
are eigenvalues of a quantum system then that system must be one-dimensional, as pointed out by Mussardo 
[27] . This conclusion follows from the fact that the primes grow roughly like nlog(7i). Eigenvalues of simple 
quantum wells with potentials of the form V{x) = \x\'' in d dimensions grow like 7j2'=/(<i(fe+2)) according to 
Weyl's law |2j . This means the steepest possible growth is for a billiard system with hard walls (corresponding 
to fc ^ cx)), which gives n^/*^. So eigenvalues of systems with two (or more) spatial dimensions cannot grow 
faster than n. This result seems to rule out the possibility that primes are eigenvalues of a pseudointegrable 
or chaotic system since all time-independent ID systems are integrable. However, simple one-dimensional 
quantum wells do not follow Poisson statistics but rather display the non-generic statistics characteristic of 
harmonic oscillators. It seems we are left with no options that fit the statistical data on the primes. 

We would like to suggest two alternatives that merit further research. The first is that the primes might 
be eigenvalues of a one-dimensional potential well that is more complicated than the simple wells discussed 
above. One such potential has already been suggested [27]. If this is the case then the apparent random 
matrix statistics for small primes has no relation to chaotic classical dynamics. We are unaware, though, 
of any one-dimensional system with eigenvalues that exhibit random matrix statistics at low energies and 
Poisson statistics at high energies. Another possibility to consider is that the primes could be eigenvalues 
of a one-dimensional potential well subject to a periodic driving field. Periodically-driven anharmonic wells 
can exhibit exactly the type of classical motion suggested by our statistical analysis, namely chaos at low 
energies and regular motion at higher energies (see Ref. p8j for an example). However, for time-periodic 
systems the appropriate eigenvalues are quasienergies rather than energies. Quasienergies are only defined 
modulo huj, where lu — 2t:/T and T is the period of the time-dependent part of the system's Hamiltonian. 
Thus, numerical calculations produce quasienergies that are all within a single Brillouin zone between and 
huj. It is possible to assign each quasienergy to its proper Brillouin zone by tracing the evolution of each 
eigenvalue as the strength of the time-periodic perturbation is increased from zero, but it is not at all clear 
how this would affect the statistical properties of the sequence. 
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5. Conclusions 



We have analyzed subsequences of prime numbers using statistical tools used in the study of quantum 
eigenvalues. Our results clearly show level repulsion among the small primes with a progressive tendency 
toward Poisson statistics for larger primes. The statistical data suggest that primes could be eigenvalues of 
a quantum system whose classical counterpart is chaotic at low energies but increasingly regular at higher 
energies. There are, however, some difficulties with such an interpretation particularly because the primes 
would have to be eigenvalues of a one-dimensional system. 

In spite of the difficulties involved in their interpretation, we feel that these results strongly suggest a 
connection between primes and eigenvalues of a quantum system. These statistical results may also be of 
interest from a purely mathematical perspective. The deviation from Poisson statistics for small primes 
might be related to other "atypical" behaviors exhibited by small primes. For example, for sequences of 
the first n primes the number of primes equal to l(mod 6) is always less than or equal to the number of 
primes equal to — l(mod 6) up to n « 6 x 10^^ (a phenomenon known as Chebyshev's bias) even though 
the infinite sequence contains equal amounts of both types of primes. A more dramatic example is that 
the Li(x) > 7t{x) for x < 10^"'^^, even through Littlewood proved that these two functions must cross 
each other an infinite number of times. Our results seem to indicate a gradual convergence toward Poisson 
statistics as more primes are included in the sequence. It may be that the infinite sequence of (unfolded) 
primes follows Poisson statistics, even though primes near the lO^^th prime show significant deviations from 
Poisson statistics. However, we are unaware of any rigorous proof that the full sequence of unfolded primes 
follows Poisson statistics. 
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6. Figures 




Fig. 1. Nearest neighbor spacing distributions (NNSD) for subsequences of unfolded primes. The first three histograms show 
the NNSD for the first (a) 10^, (b) lO"*, and (c) 10*^ primes. The fourth histogram (d) shows the NNSD for the first 10*^ primes 
after the lO^^th prime. The curves show the Poisson (dashed) and GOE (dotted) distributions as well as the Berry-Robnik 
distribution (solid) obtained by fitting the number variance data (see Fig. pb. 
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Fig. 2. Number variance S-^ as a function of interval length L for subsequences of unfolded primes. The open circles show the 
data for the first (a) 10'^, (b) 10*, and (c)lO^ primes as well as for (d) the first 10^ primes after the lO^^th prime. The curves 
show the expected results for Poisson (dashed), GOE (dotted), and GUE (dot-dashed) statistics as well as the Berry- Robnik 
distribution (solid) that best fits the data. Note the varying scales on the S^-axis. 
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Fig. 3. Skewness 71 as a function of interval length L for subsequences of unfolded primes. The open circles show the data for 
the first (a) 10'^, (b) 10"*, and (c)lO^ primes as well as for (d) the first 10® primes after the lO^^th prime. The curves show the 
expected results for Poisson (dashed), GOE (dotted), and GUE (dot-dashed) statistics. 
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Fig. 4. Excess (kurtosis) 72 as a function of interval length L for subsequences of unfolded primes. The open circles show the 
data for the first (a) 10'^, (b) 10*, and (c)lO^ primes as well as for (d) the first 10^ primes after the lO^^th prime. The curves 
show the expected results for Poisson (dashed), GOE (dotted), and GUE (dot-dashed) statistics. 
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7. Tables 



First n primes 


Pi 


First 10® primes after k 


Pi 


n = 10^ 


-0.00181 


k = 10'' 


0.555921 


n = 10^ 


0.239504 


k = IQS 


0.585383 


n = 10* 


0.328879 


k = 10^ 


0.61471 


n = 10^ 


0.430437 


k = IQio 


0.633034 


n = 106 


0.489928 


k = 10" 


0.652538 






k = 1012 


0.668721 



Table 1 

Values of pi, the fit parameter for the Berry- Robnik distribution, obtained by fitting the number variance data for various 
sequences of primes. The left two columns show the results for sequences of primes beginning with the first primes and having 
various lengths. The right two columns show the results for sequences of one million primes beginning at different starting 
values. 
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